201 
Prom 

Tadgl4 
Tryl 
Scce 
Heos 



Prom 
Tadgl4 
Tryl 
Scce 
Heps 



Prcm 
Tadgl4 
Tryl 
Scce 
Keps 



WVLTAAKC<K PNLQV. 

\m/TAAHC<K PKYTV. 

WWSAGHCxTK SRIQV. 

WVLTAAHCKM NEYTV. 
WVLTAAHC 



. F 
.R 
-R 
.H 

?P ERNRVLSRWR 



250 

LGKHNLRQRE SSQEQSSVVR AVIHPDV... 
LGDHSLQNKD GPEQEIPWO SIPHPCY... 
LGEHNIEVLE GNEQFINAAK IIRHPQY... 
LGSDTLGDRR A..QRIKASK SFRHPGY... 
VFAGAVAQAS PHGLQLGVQA VVyHGCYLPF 



DIMLLRLARP 
DLMLL2LRDQ 



AKLSELIQPL PLERDCSA. 
ASLGSKVKPI SLADHCTQ. 
AVINARVSTI SLPTAPPA. 
ARLSSMVKKV RLPSRCEP. 



DIMLIKLSSR 
DLMLV KLNSQ 

DIALV4LSSP LPLTEYIQPV CLPAAGQALV DGKICTVTGW 



251 

. . . DAASHDQ 
NSSDVEDHNH 

. . . DRKTLNN 
ST. . .QTHVN 
RDPNSEENSN 

301 

GKTAD..GDF PDTIQCAYIH LVSREECEHA 
..GTVTSPRENF PDTLNCAEVK IFPQKKCEDA 
GNTASSGADY PDELQCLDAP VLSQAKCEAS 
GTTTSPDVTF PSDLMCVDVK LISPQDCTKV 



300 

NTTSCHILGW 
PGQNCTVSGW 
TGTKCLISGW 
PGTTCTVSGW 



350 

.YPGQITQN MLCAGDEKYG 
.YPGQITDG MVCAGSSK.G 
.YPGKITSH MFCVGFLEGG 
.YKDLLENS MLCAGXPDSK 
GNTQYYGQQ. AGVLQEARVP IISNDVCNGA DFYGNQIKPK MFCAGYPEGG 



Prom 
TadglA 
Tryl 
Scce 
Keps 



351 . 

KDSCdGDSGG 
ADTCCGDSGG 
KDSCCGDSGG 
KNACNGDSGG 
IDACC3DS££i^ 



S661 XO KJc?- 1 



? 



Figure 



2, 



Comparison of amino acid sequence pf 
TADG-14 with known serine protease catalytic 
domains • 



102 7 CAGAACAAGTGTGAGAGAGCCTATCCAGGGAAGATCACCGAGGGCATGGT 1076 

lllli llllillll II II II III lllllll II llllllli 

1056 CAGAAGAAGTGTGAGGATGCTTACCCGGGGCAGATCACAGATGGCATGGT 1105 

• • « • * 
1077 CTGTGCTGGCAGCAGCAATGGAGCTGACACGTGCCAGGGTGACTCAGGAG 1126 

llllli lllllllllll M IIIMIIIIIIIIIIII II II III! 

1106 CTGTGCAGGCAGCAGCAAAGGGGCTGACACGTGCCAGGGCGATTCTGGAG 1155 

, • • • • 

1127 GCCCTCTGGTGTGCGACGGGATGCTCCAGGGCATCACCTCATGGGGCTCA 1176 

MM MIIIMI II II IIIIIIIMIIIII II Mlllllll 

1156 GCCCCCTGGTGTGTGATGGTGCACTCCAGGGCATCACATCCTGGGGCTCA 1205 

• • • • • 
1177 GACCCCTGTGGGAAACCCGAGAAJVCCTGGAGTCTACACCAAAATCTGCCG 12 26 

lllllllllllll MM IIIMIII Mill Mill ilMilll 

1206 GACCCCTGTGGGAGGTCCGACAAACCTGGCGTCTATACCAACATCTGCCG 12 55 

• ■ • " 

1227 CTACACTACCTGGATCAAGAAGACCATGGACAACAGGGACTGATCCTGG 127 5 SEQ 10 § 

- . MM IIIIIIMIMIII III I II II II Mill II I 

12 5 6 CTACCTGGACTGGATCAAGAAGATCATAGGCAGCAAGGGCTGATTCTAG 1304 btO^-^-U^t^- I 



Percent Similarity: 77.220 Percent Identity: 72.201 

Match display thresholds for the alignment (s): 

I = IDENTITY . .* 

: = 2 
. = 1 

T14pro.Jack x Neur.Jack May 8, 1997 09:27 

• • • • • 
TadQl4 1 MGRPRPRAAKTWMFLLLLGGAWAGHSRAQEDKVLGGHECQPHSQPWQAAL 50 

nil II . I- III Mill -III hi I II iiiiiiiiii 

Neurop 1 MGRPPPCAIQPWILLLLFMGAWAGLTRAQGSKILEGRECIPHSQPWQAAL 50 

• • • • • 

51 FQGQQLLCGGVLVGGNWVLTAAHCKKPKYTVRLGDHSLQNKDGPEQEIPV 100 

llh-hlllllli llllllllll IMIIIIIIII-:| Mill I 
51 FQGERLICGGVLVGDRWVLTAAHCKKQKYSVRLGDHSLQSRDQPEQEIQV 100 

• • • • • 

101 VQSIPHPCYNSSDVEDHNHDLMLLQLRDQASLGSKVKPISLADHCTQPGQ 150 

III MUM- IIMhIh-l-- Ml IMh M- I • II 

101 AQSIQHPCYNNSNPEDHSHDIMLIRLQNSANLGDKVKPVQLANLCPKVGQ 150 

♦ • • • 

151 KCTVSGWGTVTSPRENFPDTLNCAEVKIFPQKKCEDAYPGQITDGMVCAG 200 

II HIIIIIIIMIII-IIIIIIIM: I III IIIMhlMMI 

151 KCIISGWGTVTSPQENFPNTLNCAEVKIYSQNKCERAYPGKITEGMVCAG 200 

• • • • • 
201 SSKGADTCQGDSGGPLVCDGALQGITSWGSDPCGRSDKPGVYTNICRYLD 250 

II lllllllllllllllll IIIIIIIMIIM: HIIIII MM 

201 SSNGADTCQGDSGGPLVCDGMLQGITSWGSDPCGKPEKPGVYTKICRYTT 250 

251 WIKKIIGSKG 260 S£Q XO 7 

I I I I • * • 
251 WIKKTMDNRD 260 SPQ, AJO' 10 



Figure 8. Amino acid homology of TADG-14 with mouse neuropsin. 
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SEQUENCE LISTING 



(1) GENERAL INFORMATION 



(i) 
(ii) 
(iii) 
(iv) 




APPLICANT: O'Brien et al 

TITLE OF INVENTION: Novel Extracellular Serine/Protease 
NUMBER OF SEQUENCES: 10 
CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Benjamin Aaron Adlej/Ph.D. J.D. 

(B) STREET: 8011 Candle Lane 

(C) CITY: Houston 

(D) STATE: Texas 

(E) ZIP: 77071 
COMPUTER READABLE F0| 

(A) MEDIUM TYPE: /aA Mb floppy disk 

(B) COMPUTER: ^ple Macintosh 

(C) OPERATINCySYSTEM: Macintosh 

(D) SOFTWARE: Microsoft Word for Macintosh 
CURRENT APPLICATION DATA: 

(A) APPL^ATION NUMBER: 08/915,659 

(B) FILING DATE: August 21, 1997 

(C) i2!LASSIFICATI0N: 435 
PRI0R APPLICATION DATE: 
(^) APPLICATION NUMBER: 
(B) FILING DATE: 
ATTORNEY/AGENT INFORMATION: 

(A) NAME: Benjamin Aaron Adler, Ph.D. 

(B) REGISTRATION NUMBER: 35,423 

(C) REFERENCE/DOCKET NUMBER: D6020 
TELECOMMUNICATION INFORMATION: 
(A) TELEPHONE: (713) 777-2321 



(B) TELEFAX: (713) 777-6908 




(2) INFORMATION FOR SEQ ID NO: 1 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 144 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: 
(A) DESCRIPTION: prot/m 

(iii) HYPOTHETICAL: no 

(iv) ANTISENSE: no 
( V ) FRAGMENT TYPE: /internal 

(vi) ORIGINAL SOURC 

(vii) IMMEDIATE SOURCE: 

(viii) POSITION IN/<jENOME: 

(ix) FEATURE:, 

(x) PUBLICATION INFORMATION: 

(xi) SEQU^CE DESCRIPTION: SEQ ID NO. 

Trp Val Leu Thr Alac Ala His Cys Lys Lys Pro Asn Leu 

5/ 10 



1 

Gin Val 
15 



Phe Leu Glu Ly§/His Asn Leu Arg Gin Arg Glu Ser Ser Gin Glu 

20 25 30 

Gin Ser Ser/Val Val Arg Ala Val lie His Pro Asp Tyr Asp Ala 

35 40 45 

Ala Ser ^is Asp Gin Asp lie Met Leu Leu Arg Leu Ala Arg Pro 

50 55 60 



Ala Lvs Leu Ser Glu Leu lie Gin Pro Leu Pro Leu Glu Arg Asp 

65 70 75 

Cy/ Ser Ala Asn Thr Thr Ser Cys His lie Leu Gly Trp Gly Lys 

80 85 90 



Thr Ala Asp Gly Asp Phe Pro Asp Thr lie Gin Cys Ala Tyr lye 

95 100 



His Leu Val Ser Arg Glu Glu Cys Glu His Ala Tyr Pro 

110 115 

lie Thr Gin Asn Met Leu Cys Ala Gin Asp Glu Lys Tyr^ 

125 130 



Lys 
135 



Asp Ser Cys Gin Gly Asp Ser Gly Gly 

140 

(3) INFORMATION FOR SEQ ID NO: 2 

(i) SEQUENCE CHARACTERISTICS/ 

(A) LENGTH: 148 amino/acids 

(B) TYPE: amino acidy 

(C) STRANDEDNESS; 

(D) TOPOLOGY: jdnear 

(ii) MOLECULE TYPE:, 
(A) DESCRIP;riON: protein 

(iii) HYPOTHETICAL: no 

(iv) ANTISENS^ no 

(v) FRAGMENT TYPE: internal 

( vi ) ORIGMAL SOURCE: 

(vii) IMMEDIATE SOURCE: 

(viii) PpsmON IN GENOME: 

(ix) /FEATURE: 

(x) / PUBLICATION INFORMATION: 
ii) SEQUENCE DESCRIPTION: SEQ ID NO.: 2 

Trp Val V^l Thr Ala Ala His Cys Lys Lys Pro Lys Tyr Thr Val 

5 10 15 

Arg Lei/ Gly Asp His Ser Leu Gin Asn Lys Asp Gly Pro Glu Gin 

20 25 30 

Glu /Lie Pro Val Val Gin Ser lie Pro His Pro Cys Tyr Asn Ser 

35 40 45 

S;^r Asp Val Glu Asp His Asn His Asp Leu Met Leu Leu Gin Leu 

50 55 60 



Arg Asp Gin Ala Ser Leu Gly Ser Lys Val Lys Pro lie Ser Leu 

65 70 75 

Ala Asp His Cys Thr Gin Pro Gly Gin Asn Cys Thr Val Ser uay 

80 85 /90 

Trp Gly Thr Val Thr Ser Pro Arg Glu Asn Phe Pro Asp THr Leu 

95 100 / 105 

Asn Cys Ala Glu Val Lys lie Phe Pro Gin Lys Lys o/s Glu Asp 

110 115 / 120 

Ala Tyr Pro Gly Gin lie Thr Asp Gly Met Val C/s Ala Gly Ser 

125 130 / 135 

Ser Lys Gly Ala Asp Thr Cys Gin Gly Asp S^r Gly Gly 

140 145 

(4) INFORMATION FOR SEQ ID NO: 3 

(i) SEQUENCE CHARACTHdSTICS: 

(A) LENGTH: 14^ amino acids 

(B) TYPE: amijfo acid 

(C) STRANDgC)NESS: 

(D) TOPOUDGY: linear 

(ii) MOLECIMTYPE: 
(A) I^^SCRIPTION: protein 

(iii) HYPOTHETICAL: no 

(iv) ANTISENSE: no 

(v) FRAGMENT TYPE: internal 

(vi) / ORIGINAL SOURCE: 
(Wi) IMMEDIATE SOURCE: 

viii) POSITION IN GENOME: 

(ix) FEATURE: 

(x) PUBLICATION INFORMATION: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO.: 3 

Trp //al Val Ser Ala Gly His Cys Tyr Lys Ser Arg lie Gin Val 

5 10 15 




Arg Leu Gly Glu His Asn lie Glu Val Leu 

20 25 

Phe lie Asn Ala Ala Lys lie lie Arg His 

35 40 

Lys Thr Leu Asn Asn Asp lie Met Leu lie 

50 55 

Ala Val lie Asn Ala Arg Val Ser Thr lie 

65 70 

Pro Pro Ala Thr Gly Thr Lys Cys Leu lie 

80 85 

Thr Ala Ser Ser Gly Ala Asp Tyr Pro Asp 

95 100 

Asp Ala Pro Val Leu Ser Gin Ala Lys Cys 

110 Hi 

Gly Lys lie Thr Ser Asn Met Phe C 

125 

Gly Lys Asp Ser Cys Gin Gly Asp S^r Gly 

140 / 145 



(5) INFORMATION FOR SEQ ID NO: 4 

(i ) SEQUENCE CH><RACTERISTICS : 

(A) LENG^^H: 144 amino acids 

(B) TYDE: amino acid 

(C) SJRANDEDNESS: 

(D) /TOPOLOGY: linear 

(ii) MOLECULE TYPE: 
0 DESCRIPTION: protein 

(iii) / HYPOTHETICAL: no 
{iy/) ANTISENSE: no 

') FRAGMENT TYPE: internal 
( V i ) ORIGINAL SOURCE: 

(vii) IMMEDIATE SOURCE: 

(viii) POSITION IN GENOME: 

(ix) FEATURE: 



Glu Gly Asn 
Pro Gin Tyr 
Lys Leu Ser 
Ser Leu Pro 



Glu Gin 
30 

Asp Arg^ 
45, 



Gly Asn 
90 

Cys Leu 
105 

Tyr Pro 
120 

Glu Gly 
135 



(x) PUBLICATION INFORMATION: 

(xi) SEQUENCE DESCRIPTION: SEQIDNO.: 4 

Trp Val Leu Thr Ala Ala His Cys Lys Met Asn Glu Tyr Tj/r Val 

5 10 / 15 

His Leu Gly Ser Asp Thr Leu Gly Asp Arg Arg Ala Glfa Arg lie 

20 25 / 30 

Lys Ala Ser Lys Ser Phe Arg His Pro Gly Tyr S^^^ Thr Gin Thr 

35 40 / 45 

His Val Asn Asp Leu Met Leu Val Lys Leu Asn Ser Gin Ala Arg 

50 55 / 60 

Leu Ser Ser Met Val Lys Lys Val Arg Lei/ Pro Ser Arg Cys Glu 

65 IJb 75 

Pro Pro Gly Thr Thr Cys Thr Val Sei/ Gly Trp Gly Thr Thr Thr 

80 / 85 90 

Ser Pro Asp Val Thr Phe Pro Ser/Asp Leu Met Cys Val Asp Val 

95 / 100 105 

Lys Leu lie Ser Pro Gin Asp/Cys Thr Lys Val Tyr Lys Asp Leu 

110 / 115 120 

Leu Glu Asn Ser Met Leu /;ys Ala Gly lie Pro Asp Ser Lys Lys 



125 / 130 

Asn Ala Cys Asn Gly ^p Ser Gly Gly 

140 

(6) INFORMATION /or SEQ ID NO: 5 

(i) /SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 159 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: 
(A) DESCRIPTION: protein 

(iii) HYPOTHETICAL: no 

(iv) ANTISENSE: no 

(v) FRAGMENT TYPE: internal 



135 



(vi) ORIGINAL SOURCE: 

(vii) IMMEDIATE SOURCE: 

(viii) POSITION IN GENOME: 

(ix) FEATURE: 

(x) PUBLICATION INFORMATION: 

(xi) SEQUENCE DESCRIPTION: SEQIDMO.: 5 

Trp Val Leu Thr Ala Ala His Cys Phe Pro Glu Arg Asn Arg Val 

5 10 / 15 

Leu Ser Arg Trp Arg Val Phe Ala Gly Ala Val Ala Gin Ala Ser 

20 25 / 30 

Pro His Gly Leu Gin Leu Gly Val Gin Al/ Val Val Tyr His Gly 

35 j/t 45 

Gly Tyr Leu Pro Phe Arg Asp Pro Asia Ser Glu Glu Asn Ser Asn 

50 / 55 60 

Asp lie Ala Leu Val His Leu Ser/ Ser Pro Leu Pro Leu Thr Glu 

65 / 70 75 

Tyr lie Gin Pro Val Cys Leu /Pro Ala Ala Gly Gin Ala Leu Val 

80 / 85 90 

Asp Gly Lys lie Cys Thr Val Thr Gly Trp Gly Asn Thr Gin Tyr 

95 / 100 105 

Tyr Gly Gin Gin Ala Gly Val Leu Gin Glu Ala Arg Val Pro lie 

110 / 115 120 

lie Ser Asn Asp Va/ Cys Asn Gly Ala Asp Phe Tyr Gly Asn Gin 

1/5 130 135 

lie Lys Pro Lys /Met Phe Cys Ala Gly Tyr Pro Glu Gly Gly lie 

140 145 150 

Asp Ala Cys 9ln Gly Asp Ser Gly Gly 

155 

(7) INFOiyClATION FOR SEQ ID NO: 6 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1343 bp 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single-stranded 




(D) TOPOLOGY: linear / 

(ii) MOLECULE TYPE: / 
(A) DESCRIPTION: cDNA to mRNA / 

(iii) HYPOTHETICAL: no / 

(iv) ANTISENSE: no / 
( V ) FRAGMENT TYPE: / 

(vi) ORIGINAL SOURCE: / 

(vii) IMMEDIATE SOURCE: / 

(viii) POSITION IN GENOME: / 

(ix) FEATURE: / 

(x) PUBLICATION INFORmXtION: 

(xi) SEQUENCE DESCRIPnON: SEQIDNO.: 6 

CTGTAGCAGG CAGAGCTTAC CAAGTCTCTjZ CGAACTCAAA TGGAAGAAAT ACCTTATGAA 60 

TGTAAGAATG TAGGGGGTCA TGGCTTGm?^ TTTACACAGT GTAAATGAAA CCATCCTAGA 12 0 

GGATTATGAG GAATCCTTTC TATGTG^TT TCAATCATAG CAAGCAAGAA AGGCTCCAGT 180 

GTCAAGGTAG TTCAGCTCTT ACAGOCTATA AAACAGTCCA TACTTGAGAG AAAAAACTTA 240 

GATCTGAGTG ATGGAATGTG AAGC;6aTCT TTCAAAATCA GTAGACATTT CTTGGACATA 3 00 

AAACACAGAT GAGGAAAGGG CTTCAAATTA GAAGTTACGT AATCACCATC AGAAAGTTCA 3 60 

TGTTTGGTAA ATTCTGTTAC Ti^AAATGTA GGAAATTCAG GTATAGCTTT GAATCCCAAT 42 0 

TACACATTGG TCAGTGGGAA id^CTAAGGGC CTCCAACAGG CAAATTCAGG GAGGATAGGT 480 

TTCAGGGAAT GCCCTGGATyCTGGAAGACC TCACCATGGG ACGCCCCCGA CCTCGTGCGG 540 

CCAAGACGTG GATGTTCCTG CTCTTGCTGG GGGGAGCCTG GGCAGGACAC TCCAGGGCAC 600 

AGGAGGACAA GGTGCTGGGG GGTCATGAGT GCCAACCCCA TTCGCAGCCT TGGCAGGCGG 660 

CCTTGTTCCA GGGCCA(^AA CTACTCTGTG GCGGTGTCCT TGTAGGTGGC AACTGGGTCC 72 0 

TTACAGCTGC CCACT^AAA AAACCGAAAT ACACAGTACG CCTGGGAGAC CACAGCCTAC 7 80 

AGAATAAAGA TGGOCCAGAG CAAGAAATAC CTGTGGTTCA GTCCATCCCA CACCCCTGCT 840 

ACAACAGCAG CGA^TGTGGAG GACCACAACC ATGATCTGAT GCTTCTTCAA CTGCGTGACC 900 

AGGCATCCCT G(^GTCCAAA GTGAAGCCCA TCAGCCTGGC AGATCATTGC ACCCAGCCTG 960 

GCCAGAAGTG GACCGTCTCA GGCTGGGGCA CTGTCACCAG TCCCCGAGAG AATTTTCCTG 102 0 

ACACTCTCAA yCTGTGCAGAA GTAAAAATCT TTCCCCAGAA GAAGTGTGAG GATGCTTACC 1080 

CGGGGCAGAy CACAGATGGC ATGGTCTGTG CAGGCAGCAG CAAAGGGGCT GACACGTGCC 1140 

AGGGCGATTC TGGAGGCCCC CTGGTGTGTG ATGGTGCACT CCAGGGCATC ACATCCTGGG 12 00 

GCTCAGAGCC CTGTGGGAGG TCCGACAAAC CTGGCGTCTA TACCAACATC TGCCGCTACC 1260 

TGGACTGGAT CAAGAAGATC ATAGGCAGCA AGGGCTGATT CTAGGATAAG CACTAGATCT 1320 

CCCTTAATAA ACTCACGGAA TTC 1343 

( 8 ) INFORMATION FOR SEQ ID NO: 7 

/ (i) SEQUENCE CHARACTERISTICS: 

/ (A) LENGTH: 260 amino acids 



(B) TYPE: amino acid 




(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: 
(A) DESCRIPTION: protein 

(iii) HYPOTHETICAL: no 

(iv) ANTISENSE: no 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(vii) IMMEDIATE SOURCE: 

(viii) POSITION IN GENOME: 

(ix) FEATURE: 

(x) PUBLICATION INFORMATION: 

(xi) SEQUENCE DESCRIPTIONy SEQIDNO.: 7 

Met Gly Arg Pro Arg Pro Arg Ala Ala/Lys Thr Trp Met Phe Leu 

5 / 10 15 

Leu Leu Leu Gly Gly Ala Trp Ala /6ly His Ser Arg Ala Gin Glu 

20 / 25 30 

Asp Lys Val Leu Gly Gly His !Slu Cys Gin Pro His Ser Gin Pro 

35 / 40 45 

Trp Gin Ala Ala Leu Phe Gin Gly Gin Gin Leu Leu Cys Gly Gly 

50 / 55 60 

Val Leu Val Gly Gly A^n Trp Val Leu Thr Ala Ala His Cys Lys 

65 / 70 75 

Lys Pro Lys Tyr Th^ Val Arg Leu Gly Asp His Ser Leu Gin Asn 

8jf 85 90 

Lys Asp Gly Prc/Glu Gin Glu lie Pro Val Val Gin Ser lie Pro 

95 100 105 

His Pro Cys /tyr Asn Ser Ser Asp Val Glu Asp His Asn His Asp 

110 115 120 

Leu Met ^u Leu Gin Leu Arg Asp Gin Ala Ser Leu Gly Ser Lys 

125 130 135 

Val li}/s Pro lie Ser Leu Ala Asp His Cys Thr Gin Pro Gly Gin 

140 145 150 




Arg Glu 
165 

Phe Pro, 
181 

Asp /Gly 
195 

In Gly 
210 

Gly He 
225 

Pro Gly 
240 

Lys He 
255 




Lys Cys Thr Val Ser Gly Trp Gly Thr Val Thr Ser Pro 

155 160 

Asn Phe Pro Asp Thr Leu Asn Cys Ala Glu Val Lys He 

170 175 

Gin Lys Lys Cys Glu Asp Ala Tyr Pro Gly Gin He Thr 

185 190 

Met Val Cys Als Gly Ser Ser Lys Gly Ala Asp Thr Cys 

200 205 

Asp Ser Gly Gly Pro Leu Val Cys Asp Gly Ala Leu /6ln 

215 220 

Thr Ser Trp Gly Ser Asp Pro Cys Gly Arg Ser y^sp Lys 

230 235 

Val Tyr Thr Asn He Cys Arg Tyr Leu Asp jfrp He Lys 

245 25( 

He Gly Ser Lys Gly 

260 

(9) INFORMATION FOR SEQ ID NO: 8 

SEQUENCE CHARMTERISTICS: 

(A) LENGTH: /799 bp 

(B) TYPE: yJiucleic acid 

(C) STRANDEDNESS: single-stranded 

(D) TQTOLOGY: linear 
MOLEfZXJLETYPE: 

(A)/ DESCRIPTION: other nucleic acid 
TOTHETICAL: no 
(iv) /ANTISENSE: no 
FRAGMENT TYPE: 
ri ) ORIGINAL SOURCE: 

(vii) IMMEDIATE SOURCE: 

(viii) POSITION IN GENOME: 

(ix) FEATURE: 

(x) PUBLICATION INFORMATION: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO.: 




AGAGGCCACC ATGGGACGCC CCCCACCCTG TGCAATCCAG CCGTGGATCC TTCTGCTTCT 60 

GTTCATGGGA GCGTGGGCAG GGCTCACCAG AGCTCAGGGC TCCAAGATCC TGGAAGGTCG YljH 

AGAGTGTATA CCCCACTCCC AGCCTTGGCA GGCAGCCTTG TTCCAGGGCG AGAGACTGAT L80 

CTGTGGGGGT GTCCTGGTTG GAGACAGATG GGTCCTCACG GCAGCCCACT Q:Q.'M',M<hh<ZK A AO 
GAAGTACTCC GTGCGTCTGG GTGATCATAG CCTCCAGAGC AGAGATCAGC CGGAGCAGGiv/ 3 00 

GATCCAGGTG GCTCAGTCTA TCCAGCATCC TTGCTACAAC AACAGCAACC CAGAAGATGA 360 

CAGTCACGAT ATAATGCTCA TTCGACTGCA GAACTCAGCA AACCTCGGGG ACAAGGTJZAA 420 

GCCGGTCCAA CTGGCCAATC TGTGTCCCAA AGTTGGCCAG AAGTGCATCA TATCAGGCTG 480 

GGGCACTGTC ACCAGCCCTC AAGAGAACTT TCCAAACACC CTCAACTGTG CGGA^TGAA 540 

AATCTATTCC CAGAACAAGT GTGAGAGAGC CTATCCAGGG AAGATCACCG AGGj^ATGGT 600 

CTGTGCTGGC AGCAGCAATG GAGCTGACAC GTGCCAGGGT GACTCAGGAG GQCCTCTGGT 660 

GTGCGACGGG ATGCTCCAGG GCATCACCTC ATGGGGCTCA GACCCCTGTG l^AAACCCGA 720 

GAAACCTGGA GTCTACACCA AAATCTGCCG CTACACTACC TGGATCAAG^AGACCATGGA 780 

CAACAGGGAC TGATCCTGG X 799 

(lO)INFORMATIONFORSEQIDNO: 9 / 

(i) SEQUENCE CHARACTERISTICS: / 

(A) LENGTH: 799 bp / 

(B) TYPE: nucleic acid / 

(C) STRANDEDNESS: sihgle- stranded 

(D) TOPOLOGY: linea/ 

(ii) MOLECULE TYPE: / 

(A) DESCRIPTION: cDNA to mRNA 

(iii) HYPOTHETICAL no 

(iv) ANTISENSe/ 10 

( V ) FRAGMe<T TYPE: 

(vi) ORIGJ^fAL SOURCE: 

(vii) mmEDlATESOmCE: 

(viii) ^SITION IN GENOME: 
(ix/ FEATURE: 

J^) PUBLICATION INFORMATION: 

/ (xi) SEQUENCE DESCRIPTION: SEQIDNO.: 9 

AGACCT(Z!aCC ATGGGACGCC CCCGACCTCG TGCGGCCAAG ACGTGGATGT TCCTGCTCTT 60 

GCTGa^GGA GCGTGGGCAG GACACTCCAG GGCACAGGAG GACAAGGTGC TGGGGGGTCA 120 

TGACTGCCAA CCCCATTCGC AGCCTTGGCA GGCGGCCTTG TTCCAGGGCC AGCAACTACT 180 

CTOTGGCGGT GTCCTTGTAG GTGGCAACTG GGTCCTTACA GCTGCCCACT GTAAAAAACC 240 

GA^TACACA GTACGCCTGG GAGACCACAG CCTACAGAAT AAAGATGGCC CAGAGCAAGA 300 

^>&ACCTGTG GTTCAGTCCA TCCCACACCC CTGCTACAAC AGCAGCGATG TGGAGGACCA 3 60 

'CAACCATGAT CTGATGCTTC TTCAACTGCG TGACCAGGCA TCCCTGGGGT CCAAAGTGAA 420 



GCCCATCAGC 
GGGCACTGTC 
AATCTTTCCC 
CTGTGCAGGC 
GTGTGATGGT 
CAAACCTGGC 
CAGCAAGGGC 



CTGGCAGATC 
ACCAGTCCCC 
CAGAAGAAGT 
AGCAGCAAAG 
GCACTCCAGG 
GTCTATACCA 
TGATTCTAG 



ATTGCACCCA 
GAGAGAATTT 
GTGAGGATGC 
GGGCTGACAC 
GCATCACATC 
ACATCTGCCG 



GCCTGGCCAG 
TCCTGACACT 
TTACCCGGGG 
GTGCCAGGGC 
CTGGGGCTCA 
CTACCTGGAC 



AAGTGCACCG 
CTCAACTGTG 
CAGATCACAG 
GATTCTGGAG 
GACCCCTGTG 
TGGATCAAGA 



TCTCAGGjZTG 480 

CAGAAGJTAAA 540 

ATGGQATGGT 600 

GCCQCCTGGT 660 

GG;>K5GTCCGA 720 

SATCATAGG 780 
799 




( 1 1 )INFORMATION FOR SEQ ID NO: 10 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 260 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: 
(A) DESCRIPTION: /protein 

(iii) HYPOTHETICAL: 

(iv) ANTISENSE: no/ 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL/SOURCE: 

(vii) IMMEDJATE SOURCE: 

(viii) POSmON IN GENOME: 

(ix) FEATURE: 

( X ) yl^UBLIC ATION INFORMATION: 

(xi/ SEQUENCE DESCRIPTION: SEQ ID NO.: 10 

Met Gly Arg/Pro Pro Pro Cys Ala lie Gin Pro Trp lie Leu Leu 

5 10 15 

Leu Leu /§he Met Gly Ala Trp Ala Gly Leu Thr Arg Ala Gin Gly 

20 25 30 

Ser J^ys lie Leu Glu Gly Arg Glu Cys lie Pro His Ser Gin Pro 

35 40 45 

t/p Gin Ala Ala Leu Phe Gin Gly Glu Arg Leu lie Cys Gly Gly 

50 55 60 



Val Leu Val Gly Asp Arg Trp Val Leu Thr Ala Ala His Cys Lys 

65 70 75 



Lys Gin Lys Tyr Ser Val Arg Leu Gly Asp His Ser Leu Gin Sdr 

80 85 y90 

Arg Asp Gin Pro Glu Gin Glu lie Gin Val Ala Gin Ser Ide Gin 

95 100 / 105 



His Pro Cys Tyr Asn Asn Ser Asn Pro Glu Asp His 

110 115 



^er 



His Asp 
120 



lie Met Leu lie Arg Leu Gin Asn Ser Ala Asn yLeu Gly Asp Lys 

125 130 / 135 

Val Lys Pro Val Gin Leu Ala Asn Leu Cys /Pro Lys Val Gly Gin 

140 145 150 

Lys Cys lie lie Ser Gly Trp Gly Thr/Val Thr Ser Pro Gin Glu 

155 / 160 165 

Asn Phe Pro Asn Thr Leu Asn Cys/Ala Glu Val Lys lie Tyr Ser 

170 / 175 180 

Gin Asn Lys Cys Glu Arg Ala/Tyr Pro Gly Lys lie Thr Glu Gly 

185 / 190 195 

Met Val Cys Ala Gly Ser/ser Asn Gly Ala Asp Thr Cys Gin Gly 

200 / 205 210 

Asp Ser Gly Gly Prj/ Leu Val Cys Asp Gly Met Leu Gin Gly lie 

5 220 225 

Thr Ser Trp G]^ Ser Asp Pro Cys Gly Lys Pro Glu Lys Pro Gly 

230 235 240 



Val Tyr Tl>r Lys lie Cys Arg Tyr Thr Thr Trp lie Lys Lys Thr 

245 250 255 



Met a^ Asn Arg 



Asp 
260 



